Goto

Collaborating Authors

 information overload


MeSH: Memory-as-State-Highways for Recursive Transformers

Yu, Chengting, Shu, Xiaobo, Wang, Yadao, Zhang, Yizhen, Wu, Haoyi, Li, Jiaang, Long, Rujiao, Chen, Ziheng, Xu, Yuchi, Su, Wenbo, Zheng, Bo

arXiv.org Artificial Intelligence

Recursive transformers reuse parameters and iterate over hidden states multiple times, decoupling compute depth from parameter depth. However, under matched compute, recursive models with fewer parameters often lag behind non-recursive counterparts. By probing hidden states, we trace this performance gap to two primary bottlenecks: undifferentiated computation, where the core is forced to adopt a similar computational pattern at every iteration, and information overload, where long-lived and transient information must coexist in a single hidden state. To address the issues, we introduce a Memory-as-State-Highways (MeSH) scheme, which externalizes state management into an explicit memory buffer and employs lightweight routers to dynamically diversify computation across iterations. Probing visualizations confirm that MeSH successfully resolves the pathologies by inducing functional specialization across iterations. On the Pythia suite (160M-1.4B), MeSH-enhanced recursive transformers consistently improve over recursive baselines and outperforms its larger non-recursive counterpart at the 1.4B scale, improving average downstream accuracy by +1.06% with 33% fewer non-embedding parameters. Our analysis establishes MeSH as a scalable and principled architecture for building stronger recursive models.


Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP

Sovrano, Francesco

arXiv.org Artificial Intelligence

Large language models (LLMs) can amplify misinformation, undermining societal goals like the UN SDGs. We study three documented drivers of misinformation (valence framing, information overload, and oversimplification) which are often shaped by one's default beliefs. Building on evidence that LLMs encode such defaults (e.g., "joy is positive," "math is complex") and can act as "bags of heuristics," we ask: can general belief-driven heuristics behind misinformative behaviour be recovered from LLMs as clear rules? A key obstacle is that global rule-extraction methods in explainable AI (XAI) are built for numerical inputs/outputs, not text. We address this by eliciting global LLM beliefs and mapping them to numerical scores via statistically reliable abstractions, thereby enabling off-the-shelf global XAI to detect belief-related heuristics in LLMs. To obtain ground truth, we hard-code bias-inducing nonlinear heuristics of increasing complexity (univariate, conjunctive, nonconvex) into popular LLMs (ChatGPT and Llama) via system instructions. This way, we find that RuleFit under-detects non-univariate biases, while global SHAP better approximates conjunctive ones but does not yield actionable rules. To bridge this gap, we propose RuleSHAP, a rule-extraction algorithm that couples global SHAP-value aggregations with rule induction to better capture non-univariate bias, improving heuristics detection over RuleFit by +94% (MRR@1) on average. Our results provide a practical pathway for revealing belief-driven biases in LLMs.


Mitigating Clinician Information Overload: Generative AI for Integrated EHR and RPM Data Analysis

Shetgaonkar, Ankit, Pradhan, Dipen, Arora, Lakshit, Girija, Sanjay Surendranath, Kapoor, Shashank, Raj, Aman

arXiv.org Artificial Intelligence

Generative Artificial Intelligence (GenAI), particularly Large Language Models (LLMs), offer powerful capabilities for interpreting the complex data landscape in healthcare. In this paper, we present a comprehensive overview of the capabilities, requirements and applications of GenAI for deriving clinical insights and improving clinical efficiency. We first provide some background on the forms and sources of patient data, namely real-time Remote Patient Monitoring (RPM) streams and traditional Electronic Health Records (EHRs). The sheer volume and heterogeneity of this combined data present significant challenges to clinicians and contribute to information overload. In addition, we explore the potential of LLM-powered applications for improving clinical efficiency. These applications can enhance navigation of longitudinal patient data and provide actionable clinical decision support through natural language dialogue. We discuss the opportunities this presents for streamlining clinician workflows and personalizing care, alongside critical challenges such as data integration complexity, ensuring data quality and RPM data reliability, maintaining patient privacy, validating AI outputs for clinical safety, mitigating bias, and ensuring clinical acceptance. We believe this work represents the first summarization of GenAI techniques for managing clinician data overload due to combined RPM / EHR data complexities.


The Statistical Validation of Innovation Lens

Radaelli, Giacomo, Lynch, Jonah

arXiv.org Artificial Intelligence

Information overload and the rapid pace of scientific advancement make it increasingly difficult to evaluate and allocate resources to new research proposals. Is there a structure to scientific discovery that could inform such decisions? We present statistical evidence for such structure, by training a classifier that successfully predicts high-citation research papers between 2010-2024 in the Computer Science, Physics, and PubMed domains.


Explainable AI Components for Narrative Map Extraction

Keith, Brian, German, Fausto, Krokos, Eric, Joseph, Sarah, North, Chris

arXiv.org Artificial Intelligence

As narrative extraction systems grow in complexity, establishing user trust through interpretable and explainable outputs becomes increasingly critical. This paper presents an evaluation of an Explainable Artificial Intelligence (XAI) system for narrative map extraction that provides meaningful explanations across multiple levels of abstraction. Our system integrates explanations based on topical clusters for low-level document relationships, connection explanations for event relationships, and high-level structure explanations for overall narrative patterns. In particular, we evaluate the XAI system through a user study involving 10 participants that examined narratives from the 2021 Cuban protests. The analysis of results demonstrates that participants using the explanations made the users trust in the system's decisions, with connection explanations and important event detection proving particularly effective at building user confidence. Survey responses indicate that the multi-level explanation approach helped users develop appropriate trust in the system's narrative extraction capabilities.


Ext2Gen: Alignment through Unified Extraction and Generation for Robust Retrieval-Augmented Generation

Song, Hwanjun, Choi, Jeonghwan, Kim, Minseok

arXiv.org Artificial Intelligence

RAG has proven its effectiveness in reducing hallucinations We go beyond accurate retrieval to emphasize in LLMs, when their knowledge is incomplete, robust generation that remains resilient to forgetting outdated, or lacks sufficient detail to accurately and distraction by the two challenges. Our key address specific queries (Gao et al., 2023b; idea for enhancing robustness is an extract-thengenerate Fan et al., 2024). A critical aspect of RAG is the approach, Ext2Gen, where the model "retrieval" process, which involves identifying and first extracts query-relevant sentences from the retrieved selecting relevant text chunks. The quality of these chunks and then refine the information to retrieved chunks plays a pivotal role in the overall generate a precise answer. The extraction step here performance of RAG, as they form the basis serves as a chain-of-thought (CoT) process (Wei for generating factual and contextually relevant answers et al., 2022; Chu et al., 2023), where the model provides aligned with the query intent (Asai et al., the evidence first before generating the final 2024; Wang et al., 2023; Zhang et al., 2024).


LLM-Measure: Generating Valid, Consistent, and Reproducible Text-Based Measures for Social Science Research

Yang, Yi, Duan, Hanyu, Liu, Jiaxin, Tam, Kar Yan

arXiv.org Artificial Intelligence

The increasing use of text as data in social science research necessitates the development of valid, consistent, reproducible, and efficient methods for generating text-based concept measures. This paper presents a novel method that leverages the internal hidden states of large language models (LLMs) to generate these concept measures. Specifically, the proposed method learns a concept vector that captures how the LLM internally represents the target concept, then estimates the concept value for text data by projecting the text's LLM hidden states onto the concept vector. Three replication studies demonstrate the method's effectiveness in producing highly valid, consistent, and reproducible text-based measures across various social science research contexts, highlighting its potential as a valuable tool for the research community.


Intelligent Classification and Personalized Recommendation of E-commerce Products Based on Machine Learning

Xu, Kangming, Zhou, Huiming, Zheng, Haotian, Zhu, Mingwei, Xin, Qi

arXiv.org Artificial Intelligence

With the rapid evolution of the Internet and the exponential proliferation of information, users encounter information overload and the conundrum of choice. Personalized recommendation systems play a pivotal role in alleviating this burden by aiding users in filtering and selecting information tailored to their preferences and requirements. Such systems not only enhance user experience and satisfaction but also furnish opportunities for businesses and platforms to augment user engagement, sales, and advertising efficacy.This paper undertakes a comparative analysis between the operational mechanisms of traditional e-commerce commodity classification systems and personalized recommendation systems. It delineates the significance and application of personalized recommendation systems across e-commerce, content information, and media domains. Furthermore, it delves into the challenges confronting personalized recommendation systems in e-commerce, including data privacy, algorithmic bias, scalability, and the cold start problem. Strategies to address these challenges are elucidated.Subsequently, the paper outlines a personalized recommendation system leveraging the BERT model and nearest neighbor algorithm, specifically tailored to address the exigencies of the eBay e-commerce platform. The efficacy of this recommendation system is substantiated through manual evaluation, and a practical application operational guide and structured output recommendation results are furnished to ensure the system's operability and scalability.


Deep R Programming

Gagolewski, Marek

arXiv.org Artificial Intelligence

Deep R Programming is a comprehensive and in-depth introductory course on one of the most popular languages for data science. It equips ambitious students, professionals, and researchers with the knowledge and skills to become independent users of this potent environment so that they can tackle any problem related to data wrangling and analytics, numerical computing, statistics, and machine learning. This textbook is a non-profit project. Its online and PDF versions are freely available at .


How To Stay on Top of the Latest AI Research

#artificialintelligence

Artificial Intelligence (AI) is a disruptive and fast-moving field whose developmental trajectory is accelerating rapidly. In fact, the number of publications in this space has been rising dramatically in recent years. Stanford's annual Artificial Intelligence Index Report shows that the number of AI publications has increased from 162,444 in 2010 to 334,497 in 2021 [1]. If you are working in the field of AI, you have probably also noticed the shortening intervals between major industry advances such as OpenAI's DALL·E 2, GPT-3 and ChatGPT, or DeepMind's AlphaFold. Those are just some examples that captured the attention of both the general public and the tech industry as they were extensively reported on and widely circulated on social media.